Assessment and Comparison of Physical Fault Injection Techniques List of Publications

نویسندگان

  • PETER M. FOLKESSON
  • Jan Torin
چکیده

This thesis deals with the problem of validating and estimating the effectiveness of error handling mechanisms in computer systems. The main contribution is an assessment of the effectiveness and usefulness of several physical fault injection techniques. The assessment is based on fault injection experiments conducted on the fault-tolerant, distributed, real-time system MARS and the Thor microprocessor. Another key contribution is the validation of the error handling mechanisms included in these systems. The MARS system was evaluated using heavy-ion radiation, electromagnetic interference and pin level fault injection to allow, for the first time, a direct comparison of physical fault injection techniques. Significant differences in the results obtained by the techniques were observed. The results also showed that hardware based error detection mechanisms are the most effective mechanisms of MARS, but that application level mechanisms can significantly improve the error detection coverage. The thesis introduces scan-chain implemented fault injection (SCIFI), which provides higher observability and controllability than most other physical fault injection techniques. The SCIFI technique injects faults via the test access port of integrated circuits. Results of SCIFI experiments on the Thor microprocessor are compared with results of simulation based fault injection performed using a highly detailed VHDL model of Thor. The comparison show that the SCIFI technique can be more than 100 times faster than simulation based fault injection, and yet produce similar results. Additional SCIFI experiments on Thor show that the estimated error coverage may vary by more than five percentage units for different workload input sequences. A methodology for predicting the error coverage for various input sequences based on fault injection experiments with a specific input sequence is presented. Although the accuracy of the predicted values is limited, the methodology is able to find input sequences with high, medium or low error coverage.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Three Physical Fault Injection Techniques to the Experimental Assessment of the MARS Architecture Johan Karlsson

This paper describes and compares three physical fault injection techniques—heavy-ion radiation, pin-level injection, and electromagnetic interference—and their use in the validation of MARS, a fault-tolerant distributed real-time system. The main features of the injection techniques are first summarized, and then the MARS system is described. The distributed testbed setup and the common test s...

متن کامل

Comparison and Integration of Three Diverse Physical Fault Injection Techniques

— pin-level fault injection, heavy-ion radiation, and electromagnetic interference (EMI) — and their use in the validation of MARS, a faulttolerant distributed real-time system. The objectives of this study are: (i) to validate the fault-tolerance features of the MARS system, and (ii) to gain a better understanding of the features and impact of the three fault injection techniques. Coverage mea...

متن کامل

Integration and Comparison of Three Physical Fault Injection Techniques

This paper describes and compares three physical fault injection techniques—heavy-ion radiation, pin-level injection, and electromagnetic interference—and their use in the validation of MARS, a fault-tolerant distributed real-time system. The main features of the injection techniques are first summarised and analysed, and then the MARS error detection mechanisms are described. The distributed t...

متن کامل

Comparison of Physical and Software-Implemented Fault Injection Techniques

This paper addresses the issue of characterizing the respective impact of fault injection techniques. Three physical techniques and one software-implemented technique that have been used to assess the fault tolerance features of the MARS faulttolerant distributed real-time system are compared and analyzed. After a short summary of the fault tolerance features of the MARS architecture and especi...

متن کامل

Stability Assessment Metamorphic Approach (SAMA) for Effective Scheduling based on Fault Tolerance in Computational Grid

Grid Computing allows coordinated and controlled resource sharing and problem solving in multi-institutional, dynamic virtual organizations. Moreover, fault tolerance and task scheduling is an important issue for large scale computational grid because of its unreliable nature of grid resources. Commonly exploited techniques to realize fault tolerance is periodic Checkpointing that periodically ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999